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Abstract 

Group testing tackles the problem of identifying a population of K defective items from a set of n items by 
pooling groups of items efficiently in order to cut down the number of tests needed. The result of a test for a group 
of items is positive if any of the items in the group is defective and negative otherwise. The goal is to judiciously 
group subsets of items such that defective items can be reliably recovered using the minimum number of tests, while 
also having a low-complexity decoding procedure. 

We describe SAFFRON (Sparse-grAph codes Framework For gROup testiNg), a non-adaptive group testing 
paradigm that recovers at least a (1 — e)-fraction (for any arbitrarily small e > 0 ) of iT defective items with high 
probability with m = 6 C(e)iTlog 2 n tests, where C(e) is a precisely characterized constant that depends only 
on e. For instance, it can provably recover at least (1 — 10~^)K defective items with m ~ 68 iTlog 2 n tests. 
The computational complexity of the decoding algorithm of SAFFRON is 0{K \ogn), which is order-optimal. 
Further, we describe a systematic methodology to robustify SAFFRON such that it can reliably recover the set of K 
defective items even in the presence of erroneous or noisy test results. We also propose Singleton-Only-SAFFRON, 
a variant of SAFFRON, that recovers all the K defective items with m = 2e(l + a)K log K\og 2 n tests with 
probability 1 — O (), where a > 0 is a constant. By leveraging powerful design and analysis tools from modern 
sparse-graph coding theory, SAFFRON is the first approach to reliable, large-scale probabilistic group testing that 
offers both precisely characterizable number of tests needed (down to the constants) together with order-optimal 
decoding complexity. 

Extensive simulation results are provided to validate the tight agreement between theory and practice. As a 
concrete example, we simulate a case where K = 128 defective items have to be recovered from a population of 
n ~ 4.3 X 10® items even when up to 2% of the group test results are reported wrongly (i.e., negative tests are 
reported as positive and vice versa). We run the robustified SAFFRON 1000 times. We observe that all K = 128 
defective items are successfully recovered in every run with m cx 8.3 x 10® tests, and the decoding time takes only 
about 3.8 seconds on average on a laptop with a 2 GHz Intel Core i7 and 8 GB memory. 

1 Introduction 

Group testing tackles the problem of identifying a population of K defective items from a set of n items by pooling 
groups of items efficiently in order to cut down the number of tests needed. The result of a test for a group of items 
is positive if any of the items in the group is defective and negative otherwise. The goal is to judiciously group 
subsets of items such that defective items can be reliably recovered using the minimum number of tests, while also 
having a low-complexity decoding procedure. 

Group testing has been studied extensively in the literature. Group testing arose during the Second World 
War HI: in order to detect all soldiers infected with the syphilis virus without needing to test them individually, 
which was too expensive and slow, the blood samples of subsets of soldiers were pooled together and tested as 
groups, so that groups that tested negative would immediately exonerate all the individuals in the group. Since 
then, varied theoretical aspects of group testing have been studied, and more applications of group testing have 
been discovered in a variety of fields spanning across biology d, machine learning d, medicine a, computer 
science Q, data analysis ffi), and signal processing d. 
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1.1 Our Contributions 


The problem of group testing has been a very active area of research, and many variants have been studied in the 
literature. Despite the long history of group testing, our paper has some novel intellectual and practical contributions 
to the field. Our paper falls specifically into the well-studied category of large-scale probabilistic group testing 
where both the ambient test population size and the number of defective items are scalable, and where a targeted 
arbitrarily-tiny fraction of defective items can be missed. 

In this work, we introduce SAFFRON (Sparse-grAph codes Framework For gROup testiNg), a powerful frame¬ 
work for non-adaptive group testing based on modern sparse-graph coding theory 0 . Our main intellectual con¬ 
tribution is that we are, to the best of our knowledge, the first to leverage the tools of sparse-graph coding theory 
for both group testing code design, and performance analysis, based on powerful density evolution techniques. 
Recall that sparse-graph codes (e.g. Low-Density-Parity-Check (LDPC) codes 0) form the backbone of reliable 
modern communication systems (e.g. telecommunications, wireless cellular systems, satellite and deep-space com¬ 
munications, etc.). However, pooling test design for our targeted group testing problem seems quite different from 
code design for the noisy communication problem, and it is not clear if the tools of the latter are even applicable 
here. Concretely, classical coding theory deals with the design of codes based on finite-field arithmetic: e.g., in the 
binary-field case, this corresponds to the modulo-2 or XOR world. In contrast, group testing deals with the Boolean 
OR world, where each observed test output is the binary OR of the states of each of the component input items in the 
test. The non-linearity of the OR operator is at odds with finite-field arithmetic, and complicates the use of classical 
coding theory in the group testing problem. Our main intellectual contribution is to show how this challenge can 
be overcome, where we show how elegant density evolution methods and simple randomized sparse-graph coding 
designs can lead to powerful group test codes as well. 

A second contribution, which follows from the use of these powerful coding theory tools, is that we are able 
to specify precise constants in the number of tests needed while simultaneously having provable performance 
guarantees and order-optimal decoding complexity in the large-scale probabilistic group testing setting. To the best 
of our knowledge, this is new. 

We summarize our main contributions as follows. 

(i) The SAFFRON scheme recovers, with high probability, an arbitrarily-close-to-one fraction, 1 — e, of the 
defective items with m = C{e)Klog 2 n tests, where C'(e) is a constant that depends only on e and can 
be precisely computed (See Table |2]i. Moreover, the computational complexity of our decoding algorithm is 
0{K log n), which is order-optimal. For instance, SAFFRON reliably recovers at least (1 — 10~^)K defective 
items with m ~ 68 iTlog 2 n tests. 

(ii) We propose a variant of the SAFFRON scheme, Singleton-Only-SAFFRON, which recovers all the defective 
items with high probability, 1 —at the cost of m = 2e(l-|-a)Ariog Ariog 2 ntests. The computational 
complexity of the decoding algorithm is 0{K log K log n). 

(iii) SAFFRON and its variant can be systematically robustified to noise and errors by increasing the number of 
tests by a constant factor that does not change the order-complexity of the scheme. 

Extensive simulation results are provided to validate the tight agreement between theory and practice. As a 
concrete example, we simulate a case where K = 128 defective items have to be recovered from a population of 
n ~ 4.3 X 10® items with 2% errors in test results. We run the robustified SAFFRON 1000 times. We observe that 
all K = 128 defective items are successfully recovered in every run with m cx 8.3 x 10® tests, and the decoding 
time takes only about 3.8 seconds on average on a laptop with a 2 GHz Intel Core i7 and 8 GB memory. 

1.2 Related Works 

We provide a brief survey of the existing results in the literature. We refer the readers to ||2]|9]|T0l for a detailed 
survey. 

We first summarize the known results on the minimum number of tests required to solve a (non-adaptive) 
group testing problem. For group testing algorithms with zero-error reconstruction, the best known lower bound 
on the number of required tests is H( j^j^ logn) 011II12I . The best known group testing scheme under this setup 
requires 0{K^ logn) tests 0. Although the zero-error reconstruction property is definitely a desired property, 
such group testing schemes typically involve exhaustive table searches in their reconstruction procedures, and 
hence require a high computational and memory complexity of 0{K‘^n\ogn) ifTSl . The notable exception is a 
recent work m, which is the first scheme that requires O(AT® logn) tests, while having an efficient decoding 
algorithm of computational complexity poly(Ar) ■ 0{K^ log n log^ (AT® log n)) -f 0{K'^ log® n). 

Several relaxations of the group testing problem have been studied in the literature. One such relaxation is 
allowing a small error probability as well as relaxing the requirement of perfect identification. That is, the goal is 
to design a group testing scheme that identifies an approximate answer with high probability. Many approaches 
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have been proposed to design group testing schemes that allow an efficient decoding algorithm for these relaxed yet 
important problems. One such approach is random pooling design based on random bipartite graphs. For instance, 
randomized group testing schemes based on left-regular random bipartite graphs and right-regular random bipartite 
graphs are studied, respectively in ifTSll and ifT^ . Other lines of work have made use of existing pooling designs. 
In ifOlflTl , the authors use randomly chosen pools from carefully designed pools that are initially tailored for 
a zero-error reconstruction setting. With certain success probabilities, these schemes find a large fraction of the 
K defective items, while wrongly identifying a small fraction of normal items as defective items. Despite the 
simplicity of this class of constructions, performance analysis is rather convoluted and cumbersome. Further, these 
schemes are generally difficult to make robust to noise. 

Another line of work is based on an information-theoretic formulation of the group testing problem. That is, one 
assumes a prior distribution on the set of defective items, and searches for a group testing scheme with vanishing 
error probability. In ifTSl . the authors present information-theoretic bounds: with a uniform prior over the location 
of K defective items, 0(iFlog(-g^)) tests are necessary and sufficient (via random coding). Mazumdar presents 
near-optimal explicit constructions in IfTOl . Chan et al. propose novel group testing algorithms and compare their 
performances with the fundamental lower bounds im. Several works have proposed group testing schemes with 
efficient decoding algorithms. In IQ, the authors present a simple group testing procedure that efficiently recovers 
a large fraction of K defective items with 0{K log^ n) tests. While the proposed decoding algorithm runs in 
time {Klogn)‘^^^\ the algorithm returns 0{K \ogn) false positives, which need to be double-checked using a 
2-stage algorithm. One of the most notable exceptions is IMI . In this work, Cai et al. propose GROTESQUE, 
a class of efficient group testing schemes, which is the first adaptive group testing algorithm that achieves both 
an order-optimal number of tests and an order-optimal decoding complexity, but at the cost of using ©(log AT) 
adaptive stages. Their non-adaptive scheme requires 0{K log K\ogn) tests, and has a decoding complexity is 
0{K (log n + log^ K)). 

1.3 Paper Organization 

The rest of the paper is organized as follows. In Section |2] we formally define the problem, and provide the basic 
ideas, based on which we develop SAEERON. In Section[2 we provide a detailed description of SAEERON and its 
decoding algorithm. In Section IH we provide the main theoretical results of the paper. In Section |5] we robustify 
SAEERON so that it can reliably recover the defective items even with some erroneous test results. Einally, we 
provide extensive simulation results in Section |6] verifying our theoretical guarantees as well as demonstrating the 
practical performance of SAEERON. 

2 Group Testing Problem and Overview of the Main Results 

We formally define the group testing problem as follows. Consider a group testing problem with n items. Among 
them, exactly K items are defective. We define the support vector x G {0,1}", of which the z* component is 1 if 
and only if item i is defective. That is, Xi = Ijitem i is defective} for 1 < i < n. Defining supp (•) as the set of 
indices of non-zero elements, |supp (x) | = K. 

A subset of items can be pooled and tested, and the test result is either 1 (positive) if any of the items in the 
subset is defective, or 0 (negative) otherwise. Eor notational simplicity, we denote a subset by a binary row vector, 
a, of which the component is 1 if and only if item i belongs to the subset. Then, a group testing result y can be 
expressed as 


n 

y = (a,x) = Y OiXi, (1) 

i=l 


where V is a boolean OR operator. Let m be the number of pools. Denoting the pool by a^, we define the 
group testing matrix as A = (af, a|’,..., a^)^ G {0,The group testing results from m pools can also be 
represented as a column vector y = (yi, y 2 ,. ■., J/m)^ G {0,1}’", where yi is the outcome of the test. Then, 


y=A©X= 


( (ai,x) ^ 
(a2,x) 


\ (am, x) j 


( 2 ) 


The goal is to design the group testing matrix A and efficiently recover the set of defective items using the m 
test results. We denote the decoding function by qa '■ {0,1}™ —>• {0,1}". We want the decoding result x = gA{y) 
to be close to the true support vector x with some guarantee. 
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Depending on different applications, one can define ‘closeness’ and ‘guarantee’ in different ways. Let us first 
discuss various notions of ‘closeness’. The most stringent objective is exact recovery; x = x. Another objective, 
slightly looser, is partial recovery without false detections: one wants to make sure that set (x) C set (x) and 
I set (x) I > {1 — e)K. Another criterion is to find a superset of all the defective items without missed detections, 
i.e., set (x) D set (x) and |set (x) | < (1 + e)K. 

There are also several types of guarantees. The most stringent guarantee is perfect guarantee: for any x, the 
corresponding estimate has to be close with probability 1. Another popular guarantee is probabilistic guarantee. 
That is, the decoder will provide an exact or close estimate with high probability. 

Our proposed approach, SAFFRON, recovers an arbitrarily-close-to-one fraction of the defective items (par¬ 
tial recovery) with high probability (probabilistic guarantee). On the other hand, we propose the variant of the 
SAFFRON scheme that recovers all the defective items (perfect recovery) with high probability. 

An important generalization of the problem is to find the set of defective items with erroneous or noisy test 
results. We show that SAFFRON can be robustihed to noise, while maintaining its plain architecture. 

The main results of this paper are stated in the following (informal) theorem. 

Theorem 2.1. Consider a group testing problem with n items and K defective items. SAFFRON recovers a (1 — e)- 
fraction of the defective items for arbitrarily-close-to-zero constant e with high probability. The number of tests and 
the computational complexity of the decoding algorithm are 0{K log(n)), which is order-optimal in both noiseless 
and noisy settings. 

Remark 1. In some applications, it is possible to design pools in an adaptive way. In other words, the [i -f 1)* pool, 
a^+i, can be ‘adaptively’ designed after observing the first i test results or (i/i, j/ 2 , ■ • ■, J/i)- Such a scheme is called 
an adaptive group testing scheme. While an adaptive group testing scheme requires a lower number of tests and can 
potentially lead to a more efficient decoding algorithm, our focus here is only on non-adaptive group testing for the 
following reasons. First, adaptive group testing is generally applicable in limited settings of interest, whereas non- 
adaptive group testing applies broadly to any group testing setting. Moreover, non-adaptive group testing enjoys an 
important architecture advantage. In contrast to adaptive group testing algorithms which are necessarily sequential 
in nature, non-adaptive group testing feature pre-determined pools, and can therefore be easily parallelized, leading 
to more efficient implementation, especially in this era of large-scale parallel computing. 

Table [U summarizes our notation, which will be defined throughout the paper. 


Notation 

Definition 

71 

Number of items 

K 

Number of defective items 

m 

Number of pools (tests) 

M 

Number of right nodes (bundles of tests) 

X 

Binary representation of the set of defective items 

y 

Binary representation of the group test results 

Zi 

Measurements from the right node 


Measurements from the j* stage of the i* right node 

U* 

Signature vector of the i* item 

w 

The j* stage of the signature vector of the z* item 

bj 

Binary representation of z — 1 

e* 

The z* standard basis vector 

supp(-) 

The set of the indices of the non-zero elements 

wf) 

Hamming weight or the number of ones of a vector 

q 

Probability of each test outcome being wrong 

Q 

Bipartite graph representation of a sparse-graph code 

X 

Bit-wise complement vector of x 

[n] 

{1,2 ,...,n} 


Table 1: Summary of our notation 


3 The SAFFRON Scheme: Main Idea 

Our test matrix design is based on an architectural philosophy that is similar to the ones in ll2Tlj27l . The key idea 
of SAFFRON is the adoption of a design principle called ‘sparse signal recovery via sparse-graph codes" that 
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is applicable to a varied class of problems such as computing a sparse Fast Fourier Transform, a sparse Walsh 
Hadamard Transform, and the design of systems for compressive sensing and compressive phase-retrieval. In all 
these problems, one designs an efficient way of sensing or measuring an unknown sparse signal such that the 
decoder can estimate the unknown signal with a low decoding complexity. The overarching design principle is to 
1) design a sensing matrix based on a sparse bipartite graph and to 2) decode the observed measurements using a 
simple peeling-like iterative algorithm. We show how this same design principle allows us to efficiently tackle the 
group testing problem. 

Consider a bipartite graph with n left nodes and M right nodes. Here, the n left nodes correspond to the n 
items, and the M right nodes corresponds to the M bundles of test results. We design a bipartite graph based on 
left-regular construction. That is, each left node is connected to constant number d of right nodes uniformly at 
random. 

We denote the incidence matrix of a bipartite graph ^ by Tp G {0,or simply T if ^ is clear from the 
context. Let be the ith row of Tg. We associate each left node with a carefully designed signature (column) 
vector u of length h, i.e., u G {0,1}^. Let us denote the signature vector of item i by u^. We define the signature 
matrix { 7 = [ui,U 2 , ■ ■ ■ ,Un-i,Un] G {0,1}^^". 

Given a graph Q and a signature matrix U, we design our group testing matrix to be a row tensor product 
of Tg and U, which is defined as A = Tg U = [A^, A^, ..., A^\^ G {0, where Ai = [/diag(ti) G 

{0,1}^^", and diag(-) is the diagonal matrix constructed by the input vector. As an example, the row tensor product 
of matrices 


is 


T = 


0 1 0 
1 1 0 
0 0 1 


and U = 


1 0 1 
0 1 1 


■ 0 

0 

0 ■ 

0 

1 

0 

1 

0 

0 

0 

1 

0 

0 

0 

1 

1 - 

o 

0 

1 


(3) 


(4) 


For notational simplicity, we define the observation vector corresponding to right node f as = y(i-i)h+i-.ih for 
1 < i < M. Then, 


Zi = U Q diag(L)x, 1 < i < M. (5) 

In other words, is the bitwise logical ORing of all the signature vectors of the active left nodes that are connected 
to right node i. 

Our decoding algorithm simply iterates through all the right node measurement vectors {zi}f£^, and checks 
whether a right node is resolvable or not. A right node is resolvable if exactly one new defective item can be 
detected by processing the right node, i.e., the location index of the defective item is found. The decoding algorithm 
is terminated when there is no more resolvable right node. 

We now present the following terminologies. A right node that is connected to one and only one defective item 
is called a singleton. A right node that is connected to two defective items is called a doubleton. Later, we show 
that with the aid of our signature matrix, 1) a singleton is resolvable, and 2) a doubleton is resolvable if one of the 
two defective items is already identified (in the previous iterations of the algorithm). 
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3.1 Detecting and Resolving a Singleton 

Consider the following signature matrix where the z* column is a vertical concatenation of bi and its complement, 
where hi is the L-bits binary representation of an integer i — 1, for i € [n], so L = [log2 n]. 0 










■ 0 

0 

0 ... 

1 

1 ■ 









0 

0 

0 ... 

1 

1 









0 

0 

1 ... 

1 

1 

Ui ] 


bi 

b2 

ba • 


byj 


0 

1 

0 ... 

0 

1 

Ui . 


bi 

b2 

ba . 

b/x—i 

byj 


1 

1 

1 ... 

0 

0 









1 

1 

1 ... 

0 

0 









1 

1 

0 ... 

0 

0 









1 

0 

1 ... 

1 

0 


We now show that a singleton can be detected and resolved with the aid of this signature matrix. First, note that the 
sum of the weight of any binary vector and the weight of its complement is always the length of the vector, L. Thus, 
given a singleton, the weight of the measurement vector is L. Furthermore, if the right node is connected to zero or 
more than one defective items, the weight of the measurement vector will not be L. Therefore, by just checking the 
weight of the right-node measurement vector, one can simply detect whether the right node is a singleton or not. 
Further, one can also read the hrst half of the measurement of the detected singleton to find the index location of 
the defective item. 


While having only 


Ui 

Ui 


as the signature matrix suffices for detecting and resolving singletons, in the fol¬ 


lowing subsections, we show that to detect and resolve a doubleton, we need to expand the signature matrix. Thus, 
Ui 


Ui 


will be one part of our hnal signature matrix U. 


3.2 Resolvable Doubletons 


We now design the full signature matrix U by expanding 
and resolvable doubletons as follows. 


Ui 

Ui 


so that one can detect and resolve both singletons 




bi 

b2 

CO 

b„_2 

bn—1 

■ Ui ' 


bi 

b2 

ba ■ 



Ui 

U2 

U2 


bii 

b*2 

b*3 ■ 

• b,„_, 

bi„ 


bii 

b*2 

b*3 ■ 

■ bj„_i 

bin 

U 3 

Us 


bii 

bi2 

bi3 ■ 

• by„_, 

bjn 



L bji 

bi2 

bi3 • 

■ by„_, 

bjn j 


where Si = (ii, Z2, ..., in) and S2 = (ji, J2, ■ ■ ■, Jn) are drawn uniformly at random from the set [n]". Then, the 
measurement vector for the kf right node, z^, is 


Zk = U Q diag(tfe)x = 


Ui ©diag(tfc)x ■ 


’ Zfe ' 

Ui 0diag(tfe)x 


4 

U2 © diag(tfc)x 

def 

4 

U2 ©diag(tfe)x 


4 

C/3 ©diag(tfc)x 


4 

_ C/a © diag(tfe)x _ 


[ 4 \ 


( 8 ) 


We call z^ the w”' section of the k"' right-node measurement vector. Figure[T]provides an illustration of a right-node 
measurement vector z^. Assume the support vector is x = (0,1,1,..., 0,1, 0). That is, there are exactly K — 3 
defective items; item 2, item 3 and item n — 1. Also, assume that the fc* right node is connected to item 2, item 
n — 2 and item n — 1, i.e., = (0,1, 0,..., 1,1, 0). Consider the k"' right node and its corresponding right-node 

*For simplicity, the rest of paper will assume that n is a power of 2, and hence L — log2 n. 
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Figure 1: An illustration of a right-node measurement vector Zfc and its 6 sections. When the support vector x = 
(0,1,1 ,..., 0,1, 0) and the fc* row of the bipartite graph = (0,1,0,, 1,1,0) are given, the fc* right-node measurement 
vector Zk is equal to U2 + u„_i, as depicted in the figure. The right-node measurement vector consists of sections 


measurement vector zj^. By (|8]l, z^ is equal to U2 + u„_i, and it consists of 6 sections. We can also find each 
section of z^ by looking at the corresponding sections of U 2 and u„_i, as described in Figure[T] For instance, the 
first section of the /c* right-node measurement vector z^ is equal to b 2 -I- b„_i. 

Assume that right node k is connected to exactly one identified defective item, say £q. The decoder first guesses 
that the right node is a resolvable doubleton. That is, the right node is connected to exactly two defective items: one 
of them is the identified defective item £o, and the other is the unidentified defective item £i. Thus, 



= Ufo V = 

0 

X! 

V 

O' 


cr 

0 

X! 

> 

2 



= 



L J 

bfo . 




. b^o 

V b^i 


One can always recover any bit of b^j as follows. Consider the first bit of b^^. Ifb^p i = 0, b^i,i = zj , If not, 

one can read the first bit from the second section and invert it, i.e., b^^ i = z'^ Therefore, if the assumption is 
true, the decoder is able to recover the other defective item’s index, £i. Similarly, the decoder applies the decoding 
algorithm to the other four segments from z ^ to z ®, and obtain two other indices, £2 and £3. Finally, the decoder 
checks whether £2 = and whether £3 = jt-^ . If any of these conditions does not hold, the decoder concludes that 
the guess is wrong. If both conditions hold, the decoder concludes that the hypothesis is correct, declares the right 
node to be resolved, and declares a new defective item of index £ 3 . Then, 

Lemma 3.1. SAFFRON successfully detects and resolves all the singletons and all the resolvable doubletons. When 
a right-node measurement vector is connected to more than 2 defective items, SAFFRON declares a wrong defective 
item with probability no greater than 

Proof. From the above decoding algorithm description, it is clear that SAFFRON correctly detects and resolves all 
the singletons and resolvable doubletons. Consider a right node connected to more than 2 defective items. From 
the first two sections of the corresponding right-node measurement vector, SAFFRON reads the first index £ 3 . 
SAFFRON declares a wrong defective item only if item £i is not connected to the right node and the two check 
equations hold. We note that when item £i is not connected to the right node, and are independent of £2 and 
£3. Thus, the probability that £2 = and £3 = ji^ is □ 

Remark 2. Lemma lTTI ensures that the signature matrix U in (fTTT i can be used to detect and resolve all the singletons 
and all the resolvable doubletons with error probability no greater than . Later, we show that SAFFRON performs 
the described right-node decoding algorithm 0{K) times in total. Thus, by the union bound, the probability of 
having any error in 0{K) tests is bounded by O(^), which always approaches zero as n increases. However, if 
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the number of defective items is sublinear in n, to reduce the number of tests, one can keep only the first 4 sections 
in the signature matrix U. By doing so, the probability of having any error in 0{K) tests is upper bounded by 
0{^), which still vanishes since K = o{n). Therefore, one can always use a shorter signature matrix in order to 
save the number of tests by 33%, when the number of defective items is sublinear in n. 

3.3 Example 

In this section, we provide an illustrative example of the decoding algorithm of SAFFRON. 

Consider a group testing problem with n = 8 items and K = 3 defective items. Let x= (1,0,1,0, 0,0, 0,1), 
i.e., item 1, item 3 and item 8 are defective items. We show how SAFFRON can find the set of defective items. 
Recall that we design our group testing matrix A to be a row tensor product of Tg and U. Assume that a bipartite 
graph Q is designed as follows. 0 

■ 0 1 1 1 0 1 0 0 

11110 0 11 

“ 1 0 0 0 1 0 1 1 

0 110 110 1 

We have M = 5 right nodes, and n = 8 items are connected to them according to Tg. 

Assume that we drew random sequences Si and S 2 , and the realization of them are as follows. 

Si = (5,2,4,8,7,1,3,6), S2 = (3,1,5,6,3,8,2,7) (10) 


e {0,l}^^x" 


Thus, the measurement matrix of SAFFRON is as follows. 


Ui 

Ui 

U2 

U2 

Us 

Us 


0 

0 

0 

0 

1 

1 

1 

1 

0 

0 

1 

1 

0 

0 

1 

1 

0 

1 

0 

1 

0 

1 

0 

1 

1 

1 

1 

1 

0 

0 

0 

0 

1 

1 

0 

0 

1 

1 

0 

0 

1 

0 

1 

0 

1 

0 

1 

0 

1 

0 

0 

1 

1 

0 

0 

1 

0 

0 

1 

1 

1 

0 

1 

0 

0 

1 

1 

1 

0 

0 

0 

1 

0 

1 

1 

0 

0 

1 

1 

0 

1 

1 

0 

0 

0 

1 

0 

1 

1 

0 

0 

0 

1 

1 

1 

0 

0 

0 

1 

1 

0 

1 

0 

1 

1 

0 

0 

0 

1 

1 

0 

1 

0 

0 

0 

1 

0 

1 

1 

0 

1 

1 

0 

0 

1 

0 

1 

0 

0 

1 

1 

1 

0 

0 

1 

0 

1 

1 

1 

0 

1 

0 

0 

1 


= [U1,U2,U3,U4,U5,U6,U7,U8] 


( 11 ) 


Using (|5]l, we have the following equations for the 4* right-node measurement vectors. 


( Zi ^ 


/ 

U3 

\ 

Z2 



Ui V U3 V Ug 


Z3 



Ui V Ug 




1 

U3 V Ug 

/ 


Thus, we will observe the following right-node measurement vectors. 


Zi = (0,l,0,l,0,1,0,1,1,1,0,0,1,0,0,0,1,1)^ 

(13) 

Z2 = (1,1, 1,1, 1,1, 1,1, 1,1, 1,1, 1,1,0, 1,1,1)^ 

(14) 

Z3 = (l,l,l,l,l,l,l,0,l,0,l,l,l,l,0,l,0,l)^ 

(15) 

Z4 = (l,l, 1,1,0,1,1,1,1,1,1,0,1,1,0,0,1,1)^ 

(16) 


^ In the interest of conceptual clarity of the toy example, here we present a bipartite graph that is not left-regular. 




















We are now ready to decode these measurements. The decoding algorithm first finds all the singletons by 
checking whether a right-node measurement’s weight is 3L = 3 log 2 n. The weights are as follows. 

w(zi) = 9, w(z2) = 17, w(z3) = 14, w(z4) = 14 (17) 

Since w(zi) = 3 log 2 n, the decoder declares that right node 1 is a singleton. Then, it can read off the first 3 bits of 
zi. As z} = (0,1, 0), the decoder concludes that item 3 is defective. 

In the second iteration, the algorithm inspects right nodes that are potentially resolvable doubletons including 
defective item 3. Since T 2 i = T 4 1 = 1, right nodes 2 and 4 are inspected. 

Consider right node 2. We hypothesize that the right node is a doubleton consisting of defective item 3 and 
exactly one other unknown defective item. That is, we guess that Z 2 = U 3 V u^j, and recover £i,f '2 and £3 as 
described in the previous section. Then, 


£ 3 = 6 , £2 = 8 , £1 = 3 (18) 

By noticing that = *6 = 1 ^ 2 , jii = je = 8 7 ^ € 3 , the decoder declares that the right node is not a resolvable 

double ton that contains item 3. 

Consider right node 4 . The decoder again makes a guess that Z4 = U3 V . Then, it obtains three indices as 
follows. 


£ 1 = 8 , £2 = 6 , 4 = 7 (19) 

By noticing that 4,^ =4 = Q = £ 2 , jii = is = 7 = £ 3 , the decoder declares that right node 4 is a resolvable 
doubleton including item 3. Moreover, it also finds that the other defective item’s index is £i = 8 . 

In the third iteration, the decoder knows that right node 2 is not resolvable anymore as it already includes two 
identified defective items. However, right node 3 now has a possibility of being a resolvable doubleton as the 
decoder found defective item 8 in the previous iteration, and defective item 8 is also in right node 3. The decoder 
hypothesizes that right node 3 is a doubleton, i.e., Z 3 = Ug V Similarly, the decoder reads three indices, and 
the recovered three indices are as follows. 


4 = 1, 4 = 5, 4 = 3 ( 20 ) 

Because 4i = 4 = 5 = £ 2 , ji^ = ji = 3 = £ 3 , the decoder will conclude that right node 3 is a doubleton 
including defective item 8 , and that the other defective item’s index is £i = 1 . 

The algorithm is terminated as there are no more right nodes to be resolved, concluding that items 1, 3 and 8 
are defective items. 

4 Main Results 

In this section, we analyze the SAFFRON scheme. The main theoretical result of this paper is the following 
theorem. 

Theorem 4.1. Withm = 6 C(e)K log 2 n tests, SAFFRON recovers at least {l — e)K defective items with probability 
1~C1(^), where e is an arbitrarily-close-to-zero constant, and C{e) is a constant that depends only on e. Tabled 
shows some pairs of e and C(e). 


Error floor, e 

10"^ 

10"^ 

10"^ 

lo-*^ 

10^ 

10"« 

10-y 

10^ 

C(f) = S 

6.13 

7.88 

9.63 

11.36 

13.10 

14.84 

16.57 

18.30 

Left-deg, d* 

7 

9 

10 

12 

14 

15 

17 

19 


Table 2: Pairs of e and C(e) 

The computational complexity of the decoding algorithm is linear in the number of measurements, i.e., 0{K log n), 
that is order-optimal. 

Proof. First note that each right node is associated with 6 log 2 n tests based on Q. Thus, we only need to show 
that the number of required right nodes to guarantee successful completion of the algorithm is C{e)K. 

We design a d-left-regular bipartite graph with n left nodes and M right nodes as follows. Each left node is 
connected to a set of d right nodes uniformly at random, independently from other left nodes. For the analysis, 
we focus on the pruned bipartite graph constructed by the K defective left nodes and the right nodes. Then, the 
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average right degree A = Further, as K gets large, the degree distribution of right nodes approaches a Poisson 
distribution with parameter A. We define the right edge-degree distribution p{x) = where pi is the 

probability that a randomly selected edge in the graph is connected to a right node of degree i. Then, with this 
design of random bipartite graphs. 


iM j • u j -A^ 

Pi = -Frtdegree of a random right node = i) = -e 

Kd Kd 




vi-l 


z! ® 


( 21 ) 


Hence, p{x) = 

SAFFRON performs an iterative decoding procedure as follows. In the first round, it hnds all the singletons and 
their corresponding defective items as described in Section [TTI In the following rounds, it detects and resolves all 
the resolvable doubletons and recovers their corresponding defective items. This process is repeated until no new 
defective items are recovered during one iteration. 

The fraction of defective items that cannot be identified at the end of this iterative decoding algorithm can be 
analyzed by density evolution ll8l l28ll . Density evolution is a tool to analyze a message-passing algorithm. At 
iteration j of the algorithm, an unidentified defective item passes a message to its neighbor right nodes that it has 
not been recovered. Let pj be the probability that a random defective item is not identified at iteration j. The density 
evolution relates pj to Pj+i as follows. 


Pj+i = Pr(not resolvable from one children right node)'^ ^ = [1 — {pi + p 2 il — Pj))]‘^ ^, (22) 

where pi = e~^, and p 2 = Ae“^. To prove the above equation, consider the graph shown in Figure|2] At iteration 



Figure 2: A tree-like neighborhood of an edge between left node v and right node c. At iteration j + 1, a ‘not-recovered’ 
message is passed through this edge if and only if none of the other neighbors of v, have been identified as either a 

singleton or a resolvable doubleton at iteration j. 

j + 1, left node v passes a ‘not-recovered’ message to right node c if none of its other neighbor right nodes 

has been identified as either a singleton or a resolvable doubleton at iteration j. The probability that a particular 

neighbor right node, say ci, has been resolved (either as a singleton or doubleton) at iteration j is pi -f p 2 (1 — Pj ) ■ 

Then, given a tree-like neighborhood of v, all the messages are independent. Thus, the above equation follows. 

To characterize the fraction of defective items that will not be recovered by the time the algorithm terminates, 
we find the limit of the sequence {pj} as j oo. Let e = limj_^oo Pj- One can approximately calculate 

e = [1 - (Pi + P2)f~^ = [1 - - Ae“^] ^. (23) 

We can also pictorially see how pj evolves as follows. We first draw a vertical line from {pj,pj) to {pj,^{p)) 
and then a horizontal line from {pj,j{pj)) to {'j{pj),^{pj)) = {Pj+i,Pj+i)- We repeat the above procedure until 

Pj converges to a hxed point. Figure[3plots the function ^{p) = [1 — (pi + P 2 (l — p))]^ ^ when A = 1.038 and 
d = 19. Starting frompi = 1, we observe thatp^ converges to e = [l — e~^ — ^ ~ 10“^°. 

We now find a pair of design parameters {d, M) that minimize the number of right nodes M (thus the number 
of tests) given a targeted reliability e. 

{d — 1) log(l — e~^ — Xe~^) = log e. (24) 
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Figure 3; Density evolution. Plotted is the function 'j(p) = [1 — (pi + p 2 (l — with A = 1.038 and d = 19 along with 

y = p. Note that 'y{p) meets the identity function at p = 10“^°. The dash-dotted line shows how {pj} evolves starting from 
Pj = 1 as j increases. 


We solve the following optimization problem numerically. 

Kd 

min M = — (25) 

A>0 A 

subject to (d — 1) log(l — — Xe~^) = log e. (26) 

We numerically solve the optimization problem and attain the optimal A* and d* as a function of e. Some of the 
optimal design parameters for different reliability levels are shown in Table|2] We dehne C(e) = ^ = ^. 

Up to now, we have analyzed the average fraction of unidentihed defective items over a randomly constructed 
bipartite graph. To complete the proof, we need to show three more steps. First, we need to show that after a 
hxed number of iterations, pj gets arbitrarily close to e. Second, we need to show that with high probability, a 
constant-depth neighborhood of a random left node is a tree. Finally, using the steps 1 and 2, we show that the 
actual fraction of unidentihed defective items is highly concentrated around its average. These proofs are identical 
to the ones (Corollary 2.5., Lemma 2.6., Lemma 2.7.) in 1291 . which we omit for the purpose of readability. □ 

4.1 A Variant of SAFFRON: Singleton-Only-SAFFRON 

We now present a variant of the SAFFRON scheme that only detects and resolves singletons. We call this scheme 
Singleton-Only-SAFFRON. Clearly, the decoding algorithm of Singleton-Only-SAFFRON is not iterative; instead, 
it detects and resolves all singletons in a single-stage procedure. We remark that Singleton-Only-SAFFRON is 
closely related to the non-adaptive GROTESQUE, proposed in l20l : both schemes detect and resolve single- 
tons only. However, Singleton-Only-SAEERON requires signihcantly lower number of tests than the non-adaptive 
GROTESQUE. This is because our deterministic signature matrix is more efficient than the random signature ma¬ 
trices. 

We show that using only singletons costs us an extra factor ©(log K) in the number of tests and computational 
complexity. However, the Singleton-Only-SAEERON scheme can recover all the K defective items with high 
probability. 

The measurement matrix of Singleton-Only-SAEERON is similar to the one of SAEERON with the difference 
that the signature matrix only consists of C/i and {7i as stated in (|6ll. The reason is that the algorithm does not intend 
to detect and resolve doubletons, as explained in Section [L2l 

Theorem4.2. Withm = 2e(l-|-a)iTlogAriog2n — 5.437(1-fajATlog ATlog 2 n tests, Singleton-Only-SAFFRON 
finds all the K defective items with probability where e is the base of the natural logarithm, and 

a > 0. The computational complexity of the decoding algorithm is linear in the number of measurements, i.e., 
0{K log KXogn). 

Proof Eirst note that each right node is associated with 2 log 2 n tests based on (|6]l. Thus, we only need to show 
that the number of required right nodes to guarantee successful completion of the algorithm is e(l -f a)K log K. 
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We design a bipartite graph with M = e(l + a)K log K right nodes. Each left node is connected to a certain 
right node with probability P = ^ independently of everything else. Then, the average degree of a right node is 
A = Kp = 1. Note that the probability that a certain defective item is not connected to any singletons is as follows. 


E - e”') + (1 - P))"" = (1 - (27) 

i=0 A * / 

Hence, the probability that any of the K items is not found can be bounded using union bound as follows. 

Pe < K(l-pe-^)^ (28) 

/ (l+a)eK log K 

< K J (29) 

= 7s:-“ (30) 

In ( l29b . we use 1 — a; < e““ for all x. □ 

5 Robustified SAFFRON for Noisy Group Testing 


In this section, we robustify SAFFRON such that it can recover the set of K defective items with erroneous or noisy 
test results. We assume an i.i.d. noise model. That is, each test result is ‘wrong’ with probability q, i.e., 

y = A©x + w, ( 31 ) 

where the addition is over binary field, and w is an i.i.d. noise vector whose components are 1 with probability 
0 < q < -^ and 0 otherwise. 0 

Our approach is simple: we design the robust signature matrix U' consisting of encoded columns of Uj for 
1 < j < 3 and their complements. Intuitively, we treat each column of Uj as a message that needs to be transmitted 
over a noisy memoryless communication channel. An efficient modern error-correcting code guarantees reliable 
decoding of the signature. Spatially-coupled FDPC codes have the following properties JSO). 

• It has an encoding function /(•) : {0,1}^ {0,1}^/^ and a decoding function g{-) : {0,1}-'^/^ — 

{0,1}^, and its decoding complexity is 0{N). 

• If i? satisfies 


i? < 1 - H{q) - 5 = q\og 2 qP (1 - ?) log 2 (1 - g) - (32) 

for an arbitrarily small constant <5 > 0, then Pr( 5 (x w) x) < 2“^^ as N approaches infinity, for some 
constant C > 0. 

Note that each column of Uj is of length log 2 n. Thus, N = log 2 n in this setup. Given such an error-correcting 
code, we design the signature matrix U for the robust SAFFRON scheme as follows. 


g{0,l}^^x- (33) 


We now describe how the robustified SAFFRON detects and resolves a singleton. Consider a singleton right 
node k. Then, the right-node measurement vector is of the following form. 

Zk=Uk+w (34) 

The decoder first applies the decoding function g{-) to the first, third, and fifth segments of the right-node measure¬ 
ment vector and obtains g(zl), g{z^), and g{z\). Fet .fi — 1 be the decimal representation of g{z].), £2 — 1 be the 

^If g > |, one can always take the complement of all the test results, and treat the channel as if each test result is wrong with crossover 
probability 0 <g=l — g<^. 


/(bi) 

/(b2) 

/(ba) . 

.. /(b„-2) 

/(b„-i 

/(bi) 

/(b2) 

/(ba) . 

.. /(b„-2) 

/(b„-i 

/(bn) 

/(bn) 

/(bn) ■ 

/(b*„_J 

/(b.J 

/(bn) 

/(bn) 

/(bn) ■ 

/(b,„_J 

/(b.J 

/(bn) 

/(bn) 

/(bja) ■ 

/(b,„_J 

/(b,J 

. /(bn) 

/(bn) 

/(bn) ■ 

/(bnj 

/(b,J 
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decimal representation of g{z^), and ^3 — 1 be the decimal representation of g{z\). We use the following singleton 
detection rule. 


i(.i = (-2, jti = 


(35) 


If the above conditions are satisfied, the decoder declares a singleton and the location index of the defective item 

h- 

Lemma 5.1. Robustified-SAFFRONmisses a singleton with probability no greater than Robustified-SAFFRON 
wrongly declares a defective item with probability no greater than ^2+c ■ 


Proof Robustified-SAFFRON misses a singleton only if any of the 3 decoded indices is wrong. The probability 
of such an event is upper bounded by ^ by the union bound. Robustified-SAFFRON wrongly declares a defective 
item only if £1 is wrongly decoded, but dTSl l still holds. Such an event happens with probability no greater than 

□ 


1 

72 +^ 


Lemma ISTI implies that the robustified SAFFRON scheme will miss fewer than ;^-fraction of singletons. We 
compensate this loss by increasing the number of right nodes; instead of using M right nodes, we use M (l -f 
right nodes, so that the effective number of right nodes becomes M (l -f w) (1 “ w) ~ M. 

Now, consider right node k that is a resolvable doubleton with an identified defective item £0 and an unidentified 
defective item £1. Then, the right-node measurement vector is of the following form. 


■ /(b^o) ■ 


■ /(b^J ■ 


w) 

/(b^o) 


/(brj 


w? 

/(b*,„) 


fiK, ) 


w? 


V 


+ 


fiKo) 

fiKJ 



fiKo) 


fiKJ 


Wfc 

w? 

1- 

0 

1_ 


_ fiK,) _ 


L fc J 


(36) 


Consider the first two sections of the measurement vector and z^. We show that since the decoder knows 
u^j,, it can get access to the measurement -f w^. To this end, the decoder first looks at the first bit of 
If this bit is 0, then the first bit of z^ is indeed the first bit of /(b^J -f w^. Now, if the first bit of is 1, the 
decoder complements the first bit of z| to get the first bit of f{hi ^) + w|. Similar procedure can be performed for 
all bits. By collecting all the bits, the decoder has access to /(b^J + wj3, where wj3 has the same statistics as the 
original i.i.d. noise. Then, it can apply the decoding function to obtain £1. Similarly, using the other sections of the 
measurement vector, the decoder can recover £2 and £3. Thus, by testing the detection rules in dTSl) . the decoder can 
detect a resolvable doubleton. 

We now present the following theorem for noisy group testing. 


Theorem 5.2. With m = 6l3{q)C{e)K log2 n tests, Robustified-SAFFRON can recover at least (1 — e)K defective 
items with probability 1—), where e is an arbitrarily-close-to-zero constant, C{e) is a constant that depends 
only on e, and /3{q) = ^ > for an arbitrarily small constant (5 > 0. Table^shows some pairs of e and 

C{e). The computational complexity of the decoding algorithm is linear in the number of tests, i.e., 0{K\ogn). 


5.1 The Robustified Singleton-Only-SAFFRON 

The Singleton-Only-SAFFRON scheme can also be robustified in a similar manner. We design the signature matrix 


U for the robustified SAFFRON scheme as follows. 




- /(bi) 

/(b2) 

/(ba) .. 

I{hn-2) 

/(bn-l 

U = 

/(b.J 

/(b,J 

/(b.3) 

/(b,„_J 

/(b.J 


- /(b,J 

/(b,J 

/(b.a) 

/(b,„_J 

/(b,J 


3 loeo n 

e {0,1}^?^^’^ 


(37) 


As explained before, by decoding the three sections of the measurement vector and checking whether three decoded 
indices satisfy dTSl l. the decoder can detect and resolve singletons. 

Theorem 5.3. With m = 3e(l -I- a)l3{q)K log Klog 2 n ~ 8.1548(1 -I- a)l3{q)K log K\og 2 n tests, the robus¬ 
tified Singleton-Only-SAFFRON can find all K defective items with probability 1 —where (3{q) = -p > 
arbitrarily small constant 6 > 0 and some constant a > 0. The computational complexity of the 
decoding algorithm is linear in the number of measurements, i.e., 0{K log K log n). 
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Number of right nodes per defective item, M/K 

Figure 4; The average fraction of unidentified defective items obtained via simulations. For different pairs of {d, M), we 

simulated SAFFRON 1000 times and measured the average fraction of unidentified defective items. We set n = 2^®, K = 100, 
d e {3, 5, 7,9} and a: < M < IK. 


6 Simulation results 

In this section, we evaluate the performance of the SAFFRON scheme and the robust SAFFRON scheme via 
extensive simulations. We implement simulators for both schemes in Python and test them on a laptop0. 

6.1 SAFFRON 

The SAFFRON scheme recovers with high probability an arbitrarily-close-to-one fraction of K defective items with 
0(K log n) tests, as stated in Theorem l4.ll The theorem also characterizes the optimal pairs of (d*, A*) for a target 
recovery performance e: as the fraction of unidentified defective items e decreases, the corresponding optimal left- 
degree d* increases. For different pairs of (d, M), we run SAFFRON 1000 times and measure the average fraction 
of unidentified defective items; we choose n = 2^®, K = 100, d G {3, 5, 7, 9} and K < M < IK. Plotted in 
Figure|4]are the average fractions of unidentified defective items obtained via simulations for different values of d. 
As expected, we can observe that if M is close to K, the average fraction of unidentified defective items can be 
minimized by setting d = 3, and if M is close to 7K, higher values of d perform better. 

We now simulate how computationally-efficient SAFFRON’S decoding algorithm is. We measure the average 
runtime of SAFFRON with n = 2®^, while increasing the value of K. In Figure|5al we plot the simulation results, 
and they clearly demonstrate the 0{K) factor of the computational complexity. Similarly, we repeat simulations 
with AT = 2® = 32, while increasing the value of n. In Figure |5b] we plot the average runtime of SAFFRON with 
a logarithmic x-axis; we can clearly observe the Cl(logn) factor of the computational complexity. 

6.2 Robustified-SAFFRON 

We now evaluate the robustified SAFFRON scheme. In our setting, we choose n = 2®^ ~ 4.3 x 10® and AT = 2^ = 
128. For random bipartite graphs, we use d = 12 and M = 11.36Ar. The noise model is the one we described in 
Section|5] We vary the probability of error q from 0.03 to 0.05: a test result is flipped with probability from 3% to 

5%. 

While we made use of capacity-achieving codes in Theorem l5.21 we use Reed-Solomon codes for simulations 
for simplicity ED. A Reed-Solomon code takes a message of Ck symbols from a finite field of size Cq > Cn, for a 
prime power Cq, and then encodes the message into c„ symbols. This code can correct upto any \ J symbol 
errors. By using a field of size Cq = 2®, a binary representation of length L {— log2 n) can be viewed as a 4-symbol 
message, i.e., Ck = 4. Thus, the overall number of tests is as follows: 

m = 11.36 A' X — X 6 log2 n . (38) 

Number of right nodes Number of message bits 

Error-correcting code expansion 

"'We used a laptop with 2 GHz Intel Core i7 and 8 GB memory. 
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Figure 5; Time complexity of SAFFRON. We measure 



(b) Run-time with K = 2^ and varying n. 


run-time of SAFFRON for varying values for n and K. 


By having Cn = Ck + 2.t, the robustified SAFFRON scheme can correct upto t symbol errors within each section of 
the right-node measurement vector. Thus, we evaluate the performance of the robustified SAFFRON scheme with 
c„ G {6,8 ,..., 16} for various noise levels. We measure the average fraction of unidentified defective items over 
1000 runs for each setup. Figure |6] shows the simulation results; the cc-axis is the block-length of the used code, and 
the logarithmic y-axis is the average fraction of unidentified defective items. We can observe that for a higher noise 
level q, the minimum block length required to achieve a certain value of e increases. 

We also observe that the robustified SAFFRON perfectly recovers all defective items even with the presence of 
erroneous test results with certain parameters. For instance, with q — 0.02, we test the robustified SAFFRON 1000 
times with c„ = 12. For all the test cases, it successfully recovers all K — 128 defective items from the population 
of n ~ 4.3 X 10® items with m = 838080 ~ loooo ^ tests. Further, the decoding time takes only about 3.8 seconds 
on average. Similarly, we observe the perfect recovery with q — 0.01 and c„ > 12 and with q = 0.005 and Cn > 6. 

7 Conclusion 

In this paper, we have proposed SAFFRON (Sparse-grAph codes Framework For gROup testiNg), which recovers 
an arbitrarily-close-to-one (1 — e)-fraction of K defective items with high probability with 6C'(e)Ariog2 n tests, 
where C{e) is a relatively small constant that depends only on e. Also, the computational complexity of the decod¬ 
ing algorithm of SAFFRON is order-optimal. We have described the design and analysis of SAFFRON based on 
the powerful modern coding-theoretic tools of sparse-graph coding and density evolution. We have also proposed a 
variant of SAFFRON, Singleton-Only-SAFFRON, which recovers all defective items with 2e(l-|-Q;)Ar log K log2 n 
tests, with probability Further, we robustify SAFFRON and Singleton-Only-SAFFRON by using mod¬ 

ern error-correcting codes so that they can recover the set of defective items with noisy test results. To support 
our theoretical results, we have provided extensive simulation results that validate the theoretical efficacy and the 
practical potential of SAFFRON. 
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